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METHODS FOR SEPARATING AND IDENTIFYING DNA MOLECULES 

The invention relates to methods for separating and 
identifying DNA molecules in mixtures of DNA molecules 
5 having the same number of nucleotides but different base 
sequences . 

1 . Background to the Invention 

10 1 . 1 General Introduction 

Genes exist as multiple alleles which differ from each 
other by small differences in sequence. Individuals are 
often heterozygous with respect to the alleles of 
15 particular genes; i.e. individuals often have two different 
alleles of the same gene. 

In some circumstances, it is desirable to separate the 
alleles of a gene from a mixture of the alleles. For 

2 0 example, when it is desired to carry out a test to 
determine which alleles of a gene are carried by a 
heterozygous individual, it is often necessary to separate 
the two alleles before carrying out the test because the 
presence of two alleles in one test can prevent meaningful 

25 results from being obtained. 

In view of the fact that the difference between the alleles 
of a gene can be as little as one nucleotide, it is often 
difficult to separate the alleles from a mixture of the 
30 alleles. These difficulties are increased in genes which 
have a very large number of different alleles, such as the 
major histocompatibility complex (MHC) genes (e.g. the 
human leucocyte antigen (HLA) class I genes which have 222 
known alleles) . 

35 

HLA matching between a bone marrow or kidney recipient and 
donor is one of the major factors influencing transplant 
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success. Up to date the most favourable bone marrow 
transplant (BMT) and kidney transplant results have been 
obtained using sibling donors who are genotypically HLA- 
identical to the recipient but such donors are available 
5 for only about 30% of patients (1 " s) . BMT using unrelated 

donors can be successful, but these transplants have higher 
rates of graft failure, increased incidence and severity of 
Graft versus Host Disease and more frequent complications 
related to delayed or inadequate immune reconstitution (4) . 

10 

New molecular biological methods for detection of genetic 
polymorphism currently provide an opportunity to improve 
e.g. HLA matching of unrelated donors as well as a research 
tool to investigate the relationship between disparity and 
15 transplant complications. These molecular typing methods 

include sequence-specific amplification, hybridisation with 
oligonucleotide probes, heteroduplex analysis, single 
strand conformation polymorphism and direct nucleotide 
sequencing . 

20 

Each of these molecular approaches has been used for 
routine HLA class II typing (6) , but a variety of reasons 
related to the HLA class I gene structure has complicated 
their application to class I typing. The reasons for these 
25 limitations are the extensive polymorphism of each class I 
locus and the degree of sequence homology between the loci. 
In addition, sequence homology between class I classical 
and non- classical genes and the reported 12 pseudo genes 
can cause problems for specific locus amplif ication (7) . 

30 

The low extent of "allele specific" sequences at 
polymorphic sites is a feature of the HLA class I genes 
that has limited the resolution of all current DNA typing 
approaches. An "allele specific" sequence is a sequence 
35 that is only present in one allele and can therefore be 
used to distinguish the allele from other alleles. 




- 3 - 

The main problem which complicates the identification of an 
allele is the presence of a mixture of alleles, as well as 
contamination by segments of DNA which have homology to the 
allele it is wished to identify and which are co-amplified 
5 in PCR. Current typing methods are sometimes unable to 
resolve the allele it is wished to identify from 
contaminating DNA fragments. Separation techniques such as 
single strand conformation polymorphism (SSCP) can only 
partially resolve this problem, 

10 

1 . 2 Sequence specific primer amplification (PCR-SSP) 

This method utilises both the group- specific and, when 
present, allele-specif ic sequence sites in PCR primer 

15 design. Although each SSP may not individually define the 
required specificity, the use of combinations of sequence- 
specific primers allows the amplification of their common 
sequences to give the desired HLA specificity. The SSP 
design is based on the amplification refractory mutation 

20 system (ARMS), in which a mismatch at the 3' residue of the 
primer inhibits non-specific amplif icat ion {8 ' 9) . 

However, despite its high accuracy, PCR-SSP is only in some 
cases more informative than serology. The reason for this 

25 is the low extent of allele specific polymorphic sites in 
the exon sequences and this limitation has produced a vast 
amount of research into the identification of allele 
specific motifs localised in the intron sequences' 10 *. 
However, up to date this information has not contributed 

30 considerably to the identification of more alleles. 

Another limitation of this method is that it detects a 
limited number of polymorphic sequences which are utilised 
to predict the entire sequence. If an unknown allele is 
3 5 present in a particular sample this extrapolation may be 
incorrect . 




- 4 - 

1 . 3 Single strand conformation polymorphism (SSCP) 

This technique is based on the electrophoretic mobility of 
single stranded nucleic acids in a non-denaturing 
polyacrylamide gel, which depends mainly on sequence- 
5 related conformation 111 " 135 . The technique can be employed 
for isolating single alleles which could then be used for 
further manipulation and analysis such as direct 
sequencing. The pattern of bands obtained after 
electrophoresis may be diagnostic for an allele. Although 
10 the technique is easy to perform, the specificity and 

sensitivity are poor and its use has been limited to only 
few HLA class I subtypes < 14 ' 151 . 

The successful use of the technique relies on group 
15 specific amplification and therefore prior knowledge of 
broad HLA specificity is needed. 

1 . 4 Heteroduplex analysis 

2 0 Fully matched DNA duplexes are more stable than those with 

base mismatches. Instability of the duplex increases with 
the number of nucleotide mismatches which produce an 
increasing "drag effect" in polyacrylamide gels which 
retard the migrating bands (16_18) . 

25 

This technique exploits the formation of mismatched DNA 
hybrids (heteroduplex) which are formed at the end of each 
PCR cycle between coamplified alleles from a particular 
locus or loci. During the annealing stage of each cycle of 

3 0 the PCR, a proportion of sense strands of each allele may 

anneal to anti-sense strands of different alleles. A band 
pattern is obtained in PAGE analysis which can be useful 
for identifying the alleles involved in the reaction 16 ' 19 ' 211 . 

35 Heteroduplex analysis is an approach that has been utilised 
to compare HLA genes of a particular donor and recipient. 
HLA genes are amplified, denatured (melted into single 
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strands) and mixed together under conditions that promote 
renaturation to form double stranded molecules. If the HLA 
genes of a donor and recipient are similar but not 
identical, heteroduplexes will form consisting of one 
5 strand of an allele of donor origin and a second strand 
from a different allele of recipient origin {22,23) . 
Heteroduplex molecules can be detected by altered 
electrophoretic migration. The sensitivity of this method 
can be increased by adding DNA from an HLA allele that is 
10 not present in the donor or recipient. 

The major advantage of heteroduplex analysis is that it is 
relatively easy and inexpensive. Limitations of this 
approach include inability to detect certain HLA 
15 disparities, potential detection of irrelevant silent 

mutations and lack of specific information regarding the 
nature of the alleles involved. 

Up to date this approach has been used only for HLA class 
20 II typing with limited success. It has not been applied to 
HLA class I analysis. 

1 . 5 Sequence specific oligonucleotide probes (PCR-SSO) 

25 SSO typing involves amplification of HLA alleles from a 

particular locus followed by hybridisation with a panel of 
oligonucleotide probes to detect polymorphic sequences that 
distinguish one allele or group of alleles from all others. 
This is sometimes referred to as low resolution or generic 

30 oligotyping. Following low resolution typing, selected 

primers can be used to achieve amplification of individual 
alleles which are then identified by specific probes. This 
second stage of oligotyping is often referred to as high 
resolution oligotyping* 61 . 

35 

The major methodological drawback of this approach is that 
the complexity of the technique is directly related to the 
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number of alleles under investigation and the presence of 
two alleles in the heterozygous condition can complicate 
the identification process. 

5 Based on PCR-SSO several typing approaches for HLA-A and B 
have been published and the number of probes that they 
require are over 40 and 90 probes respectively (24 ' 25) . This 
leads to a highly complex operation which is time consuming 
and the resolution obtained is only moderate. 

10 

Published oligotyping methods could result in incorrect 
interpretation of data if certain combinations of recently 
discovered alleles are present in a specimen t26) . This 
limitation and the great number of probes needed have led 
15 several groups to explore the feasibility of automated 
sequencing for routine HLA class I typing. 

The advantages of PCR-SSO methods are specificity, 
sensitivity, simplicity, reproducibility, inexpensive and 
20 that they allow simultaneous processing of many samples. 

1 . 6 Direct Nucleotide Sequencing 

This technique involves PCR amplification using one 
25 biotinylated and one non-biotinylated primer. Following 
amplification, PCR products are captured by streptavidin- 
coated magnetic beads and the non-biotinylated strand is 
removed, this permits unidirectional solid-phase sequencing 
of purified single-stranded DNA (6) . 

30 

Sequencing methods can be differentiated by whether the 
template has been cloned or produced directly from genomic 
DNA by an initial PCR amplification. Sequencing approaches 
can also employ either manual or automated techniques (14> . 
3 5 Cloned templates and templates produced from allele - 

specific PCR amplification represent a single sequence 
derived from one HLA haplotype. In contrast, alleles from 
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both HLA haplotype in heterozygous samples may be co- 
amplified and sequenced together using locus-specific PCR 
primer. The recent availability of software, which allows 
the user to align the derived sequence against established 
5 libraries, has facilitated the analysis and allele 
assignments for heterozygous samples in which both 
templates are sequenced at the same time {27) . The 
effectiveness of this method depends on the amount and 
frequency of ambiguous heterozygous combinations which for 
10 HLA class II are not many, but for class I the number of 
ambiguous combinations is so high that the obtained 
resolution is comparable with serology. 

Up to date two HLA class I typing approaches based on 
15 direct sequencing have been published, both require 
serology information followed by allele specific PCR 
amplification and then direct sequencing* 14,27 ' . Although 
this approach gives the highest resolution, it is expensive 
and difficult to perform routinely. 

20 

2 . Summary of the Invention 

The invention provides methods for separating and 
identifying a DNA molecule in a mixture of DNA molecules 
25 having the same number of neucleotides but different base 
sequences. The separation methods comprise 

(i) amplifying the DNA molecules in the mixture ; 

3 0 (ii) hybridising single strands of the amplified DNA 

molecules with a complementary strand of a 
reference DNA molecule so as to form duplexes; 
and 

3 5 (iii) separating the duplexes. 

The different DNA molecules in the original mixture give 



# • 



- 8 - 

rise to duplexes having different numbers, positions or 
types of mismatches. This allows the duplexes to be 
separated by, for example, gel electrophoresis. The 
separated duplexes can then be analysed to identify the DNA 
5 molecules that were present in the original mixture. One 
embodiment of a method of the invention is illustrated in 
Figure 1 . 

The method of the invention can be used directly as a 
10 diagnostic technique to identify a DNA molecule by the use 
of a specific reference DNA molecule. The formation of a 
homoduplex identifies a DNA molecule in the unknown mixture 
as identical to the reference DNA molecule. The formation 
of a heteroduplex may also be used to identify an unknown 
15 DNA molecule by using a known heteroduplex as a control. 
The invention includes a method for identifying a DNA 
molecule, which method comprises (i) contacting the DNA 
molecule with a labelled reference DNA strand under 
conditions such that the reference stand hybridizes to a 
20 complementary strand of the DNA molecule so as to form a 
test duplex; (ii) running the test duplex and one or more 
control duplex (es) in a gel by electrophoresis; and (iii) 
comparing the position of the test duplex on the gel with 
the position (s) of the control duplex (es). 

25 

The method of the invention can also be used as a 
separation technique for separating the alleles in a 
mixture of unknown alleles of a polyallelic gene, such as 
the mammalian MHC genes (e.g. the HLA genes) . Duplexes 
3 0 formed between the unknown alleles and a reference allele 
are separated so as to isolate the unknown alleles for 
identification by techniques such as DNA sequencing, SSP 
and SSO. 

3 5 In one embodiment, the invention provides a method which 
comprises 



(i) amplifying the DNA molecules in the mixture 
employing a pair of primers in which one of the 
primers carries a ligand, so as to produce an 
amplified mixture of double -stranded DNA 
molecules in which one of the strands carries a 
ligand; 

(ii) contacting the amplified mixture of double- 
stranded DNA molecules with a receptor on a solid 
support under conditions such that the ligand 
binds to the receptor ; 

(iii) separating the mixture of double -stranded DNA 
molecules into single -strands and removing the 
strands that are not bound to the support by 
the ligand; 

(iv) recovering the remaining strands from the 
support ; 

(v) mixing the recovered strands with a complementary 
strand of a reference DNA molecule so as to form 
duplexes; and 

(vi) separating the duplexes. 



The complementary strand of the reference DNA molecule may 
be provided by essentially the same technique as the 
technique set out above in steps (i) to (iv) for providing 
the mixture of DNA molecules in single -standard form. In 
particular, the reference complementary strand DNA molecule 
may be provided by 

(i) amplifying the reference DNA molecule employing 
a pair of primers in which one of the primers 
carries a ligand, so as to produce amplified 
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double -stranded reference DNA molecule in which 
one of the strands carries a ligand; 

(ii) contacting the double -stranded reference DNA 

5 molecule with a receptor on a solid support under 

conditions such that the ligand binds to the 
receptor; 

(iii) separating the double -stranded reference DNA 
10 molecule into single-strands and removing the 

strand that is not bound to the support by the 
ligand; and 



(iv) recovering the remaining strand from the support. 



15 



Ih another embodiment of the invention, there is provided a 
method which comprises 

2 0 (i) amplifying the DNA molecules in the mixture 

employing a pair of primers in which one of the 
primers carries a high molecular weight molecule, 
so as to produce an amplified mixture of double - 
stranded DNA molecules in which one of the 

25 strands carries a high molecular weight molecule; 

(ii) separating the mixture of double -stranded DNA 
molecules into single strands; 

30 (iii) mixing the single strands with a complementary 

strand of a reference DNA molecule so as to form 
duplexes ; and 



(iv) separating the duplexes. 



35 



The complementary strand of the reference DNA molecule may 



be provided by 



(i) amplifying the reference DNA molecule employing a 
pair of primers in which one of the primers 
carries a high molecular weight molecule, so as 
to produce an amplified double -stranded reference 
DNA molecule in which one of the strands carries 
a high molecular weight molecule; and 

(ii) separating the double -stranded reference DNA 
molecule into single strands. 

This embodiment overcomes the need for solid support 
systems by conjugating one primer of a pair of primers 
directly to a high molecular weight molecule (e.g. a 
protein) , for the reference and test systems. The 
amplified product after hybridisation can be applied 
directly to a separating gel. The high molecular weight 
conjugates are retained in the gel compared to the duplex 
without attachment of the high molecular weight molecule. 

In a further embodiment of the invention, there is provided 
a method which comprises 

(i) amplifying a single strand of each of the DNA 
molecules in the mixture; 

(ii) mixing the amplified single strands with a 
complementary strand of a reference DNA molecule 
so as to form duplexes; and 

(iii) separating the duplexes. 

In this embodiment, the complementary strand of the 
reference DNA molecule may be provided by amplifying a 
single strand of the reference DNA molecule. The 
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amplification of the single strand of the reference or test 
DNA molecule can be done, for example, by asymmetric PCR. 

This embodiment overcomes the need for both solid support 
5 systems and conjugation of one primer of a pair to a high 
molecular weight molecule. However, in the embodiment it 
is possible to use a primer carrying a ligand such as a 
hapten in order to facilitate capture of the amplified 
strand with a receptor such as an antibody and separation 
10 of the amplified strand from other components in the 
amplification mixture. 

After separation of the DNA molecules by one of the above 
methods, the molecules present in the mixture may be 
15 identified by carrying out one or more of the following 
steps : 

(i) comparing the positions of the separated duplexes 
on the gel with the position of a control DNA; 



20 



25 



(ii) sequencing each of the separated molecules ; 

( iii) sequence specific primer (SSP) amplification 
analysis; and 

(iv) sequence specific oligonucleotide (SSO) analysis 



The invention provides an improvement over prior methods 
for separating DNA molecules. The advantages offered by 
3 0 the invention can be summarised as follows: 

(a) The invention provides a high resolution between 
different DNA molecules and differences of as 
little as one nucleotide between molecules can be 

35 detected. 

(b) The invention allows simultaneous and rapid 



processing of a large number of samples. 



(c) The invention is comparatively inexpensive to 
perform, particularly when compared to prior 
methods which achieve a high level of resolution. 

(d) The invention uses techniques that can be 
performed easily without recourse to complex and 
expensive technology - 

3 . Principle underlying the invention 

Fully matched DNA duplexes are more stable than those with 
base mismatches. Regions of nucleotide sequence 
complementary are double stranded, but mismatched regions 
form single -stranded loops along the length of the DNA 
molecule. The number, size, composition and position of 
the single-stranded loops vary for each combination of 
alleles. The rate at which the DNA migrates in 
polyacrylamide or special agarose gels depends on both 
molecular conformation and molecular weight . Heteroduplex 
DNA always migrates more slowly than the corresponding 
homoduplex DNA. Both denaturing reagents and/or heat 
enhance the degree of separation of the mismatched DNA 
strands (heteroduplexes) from the corresponding matched 
duplexes (homoduplexes ) . 

As the molecular conformation of heteroduplexes can be 
manipulated by hybridisation of a known single strand 
reference with unknown complementary single strand(s), it 
is proposed that heteroduplexes can be separated from each 
other by e.g. denaturing or non-denaturing polyacrylamide 
electrophoretic analysis. This allows the separation of 
the two amplified alleles from a particular locus for 
further analysis. In addition, the method of the invention 
(which we call "Complementary Strands Analysis" (CSA) ) , 
permits assessment of the quality of the PGR product before 
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the process of identification is carried out. CSA is able 
to identify the presence of coamplified non-desirable 
alleles from different loci and, potentially, PCR fragments 
that contain artifacts such as Taq errors and in vitro 
5 recombinations. 

In addition, CSA itself can be used as a diagnostic 
technique. It can identify alleles by hybridisation of 
allele specific single strand with unknown complementary 

10 single strand (s) followed by e.g polyacrylamide gel 
electrophoretic analysis, with or without denaturing 
conditions and/or with or without a temperature gradient. 
The formation of a homoduplex demonstrates identity between 
at least one of the unknown alleles and the allele specific 

15 reference, and non- identity between them produces 
heteroduplex (es) . 

4 . Detailed Description of the Invention 

2 0 The kinds of DNA molecule that may be separated and 

identified by the methods of the invention include alleles 
of polyallelic genes, segments of genes and non-expressed 
fragments . 

25 Examples of genes with multiple alleles to which the 

invention may be applied are the mammalian MHC genes such 
as the HLA class I and class II genes, the T cell receptor 
genes in mammals < 30 ' 31) , TAP, LMP, ras (29> , non classical HLA 
class I genes, the genes for human complement factors C4 

30 and C2 , Bf in the human HLA complex, and genes located in 
mitochondrial DNA, bacterial chromosomes and viral DNA. 
The invention can be used in the analysis and 
identification of mutations (e.g. point mutations) in these 
and other genes and chromosomal aberrations such as 

35 translocations, deletions and inversions. 



There are three different genes within the HLA class I 



m m 
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group of genes, namely HLA-A, HLA-B and HLA-C, and each of 
these three genes exists in the form of multiple alleles. 
There are a total of about 222 known alleles of the HLA-A, 
HLA-B and HLA-C genes and the sequences of known alleles 
5 are set out in Arnett and Parham (1995) Tissue Antigens 4_5 
217-257. There are also multiple genes within the HLA 
class II group of genes, known as DR, DQ and DP. 

In the method of the invention, it is necessary to identify 
10 primer sequences unique for the target gene so as to 

include all polymorphic sites of interest in the amplified 
fragment, which should also be manageable in length. For 
example, the polymorphic sites in exons 2 and 3 of HLA 
class I would facilitate the identification of all 
15 recognised alleles of HLA-A, B and C, with 5 exceptions, if 
locus-specific amplification is achieved. Therefore, the 
primers used in the invention may, for example, be selected 
so as to specifically amplify exons 2 and 3 of each of HLA- 
A, HLA-B and HLA-C separately. Cereb (28} and. collaborators 

2 0 have described primer sequences located in the first and 

third exons which can be used for locus-specific 
amplification of the entire exon 2 and 3 region of each of 
the HLA-A, HLA-B and HLA-C genes. The sequences of 
suitable primers are given in the Example below. 

25 

The reference DNA molecule used in the invention generally 
has a known sequence . The reference may be chosen so as to 
have a similar allotype to an allotype that at least one of 
the test alleles is suspected of having. For example, it 

3 0 may be known that a test allele is of the HLA -AO 2 type from 

serological data, but it may not be known which of the 
seventeen A02 sub-types the allele is. In this case, the 
reference allele may be chosen to be of sub-type A0201 and 
the method of the present invention could then be used to 
35 determine which of the A02 sub-types the test allele is. 

The reference strand may be obtained from (a) a homozygous 
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source, (b) a heterozygous source from which individual 
strands are isolated by gel separation after amplification 
steps or (c) DNA synthesis. There are now about 5 00 
internationally recognised cell lines which contain HLA 
5 alleles of known sub-type and these cell lines can be used 
as a source of reference alleles. 

The initial broad typing of the test allele may be 
performed using serological techniques. Different alleles 

10 of HLA induce different types of antibody and the antibody 
induced by an unknown HLA therefore gives information as to 
the allotype of the HLA. However, serological typing does 
not achieve a high level of resolution and is not 
sufficiently accurate or reproducible for matching a 

15 prospective donor with a prospective recipient in a tissue 
or organ transplant operation. 

The control DNA used in the method of the invention may be 
a homoduplex between two strands of the same DNA molecule 

20 (e.g. the reference DNA molecule), so that migration of a 

test duplex to the same position on the electrophoretic gel 
as the control homoduplex indicates that the test duplex is 
a homoduplex. If the test duplex is a homoduplex, it can 
be concluded that the unknown DNA molecule is the same as 

25 the reference molecule. 

Control DNAs may be obtained by simply amplifying a known 
DNA molecule using the same primers as used in the method 
of the invention to amplify the reference and unknown 
30 molecules. 

The control DNA may also be a heteroduplex of known DNA 
molecules. This allows the method of the invention to be 
used to identify molecules in heteroduplexes formed by test 
35 samples. The same heteroduplexes from different sources 
migrate to the same position on a gel. 
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A potential problem with identifying molecules in 
heteroduplexes is that certain different heteroduplexes 
containing the same number and type of mismatches migrate 
to the same position. However, this problem can be 
5 overcome. For example, in typing of the HLA class I 
alleles, the different duplexes could be identified by 
their different sizes (the HLA-A, HLA-B and HLA-C genes are 
different sizes) or by amplifying each of the HLA-A, HLA-B 
and HLA-C genes with primers carrying different labels. 

10 Each locus specific heteroduplex would have a different 
size or carry a different label, and could be 
electrophoresed simultaneously in the same track of a gel-. 
The duplexes could then be identified by comparing them to 
control duplexes in the same gel. Examples of suitable '~ 

15 labels include radiolabels, colour labels and fluorescent 
labels. 

The mixture of alleles used in the method of the invention 
may be from a prospective donor or a prospective recipient 
20 in a tissue or organ transplant operation. The results of 
the method may therefore be used to match a prospective 
recipient with a prospective donor. 

In one embodiment of the invention, the alleles of the 
25 prospective donor or of the prospective recipient are in 
effect used as reference alleles and duplexes are formed 
between strands of the prospective recipient's alleles and 
of the prospective donor's alleles. Analysis of the 
duplexes formed between the strands from the prospective 
3 0 recipient and donor reveals whether they have compatible 

alleles. Thus, in one embodiment, the invention provides a 
method for determining whether a prospective recipient in a 
tissue or organ transplant operation has alleles of a gene 
that are compatible with the alleles of a prospective donor 
35 in the operation, which method comprises 



(i) amplifying the alleles of the prospective 
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recipient employing a pair of primers in which 
one of the primers carries a ligand, so as to 
produce amplified double- stranded alleles of the 
prospective recipient in which one of the strands 
carries a ligand; 

(ii) contacting the amplified double -stranded alleles 
with a receptor on a solid support under 
conditions such that the ligand binds to the 
receptor; 

(iii) separating the double -stranded alleles into 
single -strands and removing the strands that are 
not bound to the support by the ligand; 

(iv) recovering the remaining strands from the 
support ; 

(v) mixing the recovered strands with complementary 
strands of the alleles of the prospective donor 
so as to form test duplexes; 

(vi) separating the test duplexes by gel 
electrophoresis; and carrying out one or more of 
the following steps: 

(vii) comparing the positions to which the test 
duplexes migrate on the gel with the position of 
a control DNA; 

(viii) sequencing one or both strands of each of the 
test duplexes; 

(ix) sequence specific primer (SSP) amplification 
analysis; and 



(x) sequence specific oligonucleotide (SSO) analysis. 
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Other proposed uses of the invention include determination 
of the paternity of an individual by identifying one (or 
more) of his alleles to see if it is the same as a 
corresponding allele of a potential father. The invention 
5 may also be used in forensic medicine to determine the 

origin of a sample of body tissue or fluid, as a follow up 
technique in treatment of haematological malignancies or 
inherited disorders, in adoptive immunotherapy, and in 
identification of bacteria and viruses. 

10 

In the method of the invention, the amplification steps may 
be carried out by polymerase chain reaction (PCR) . 

The ligand/receptor system used in the invention may, for 
15 example, be the biotin/streptavidin system or a 

hapten/antibody system. Direct conjugation of the primer 
via a linking group, such as short poly A, to the beads is 
an alternative. When the biotin/streptavidin system is 
used, one of the primers used in each of the amplification 
20 steps may be labelled with biotin, so that when the 

amplification reaction is carried out double -stranded DNA 
is produced in which one strand carries a biotin label. 
The double -stranded DNA may then be bound to a solid 
support coated with streptavidin. 

25 

The solid support used in the invention is typically 
magnetic beads. However, other supports may be used, such 
as the matrix of an affinity chromatography column. When 
the support is in the form of magnetic beads, the two 

3 0 strands of the amplified DNA are separated by attracting 
the beads to a magnet and washing the beads under 
conditions such that the double-stranded DNA dissociates 
into single-strands. The dissociation is typically 
performed by incubating the beads three times under 

3 5 alkaline conditions (e.g. 0.1 M NaOH) at room temperature 
for about 5 minutes. Usually, the strand which is not 
bound to the support by the ligand is then discarded, 
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although it is equally possible to retain the strand that 
is not bound to the support and discard the strand that is 
bound to the support . 

5 The strand that remains attached to the support may be 

recovered from the support by incubating the support under 
conditions such that the ligand/receptor complex 
dissociates. When the biotin/streptavidin system is used, 
the support is typically heated to e.g. 95 °C for about 5 
10 minutes; this ensures denaturation of the streptavidin 

molecule to release the biotinylated single strand which is 
then recovered. 

At this stage, there have been provided a single-stranded 
15 unknown allele and the complementary strand of a reference 
allele. The two strands are then mixed together under 
conditions in which they hybridise to form duplexes. 
Typically, the hybridisation step is performed by heating 
the mixture of strands at about 95 °C for about 3 min, at 

2 0 about 70 °C for about 5 min and then at about 6 5 °C for about - 

45 min. 

Under these conditions, duplexes are formed which can 
subsequently be separated by gel electrophoresis (e.g. 
25 polyacrylamide gel electrophoresis) . The electrophoresis 
is preferably carried out under denaturing conditions 
because this has the effect of amplifying the loops formed 
by mismatched strands and hence allows better resolution. 

3 0 As an alternative separation technique to gel 

electrophoresis, high pressure liquid chromatography (HPLC) 
may be used. 

In the embodiment of the invention in which one of the pair 
3 5 or primers is conjugated to a high molecular weight 

molecule, the molecule may be a protein such as bovine 
serum albumin (BSA) . The molecular weight of the high 



molecular weight molecule is such that it causes the DNA 
molecule to which it is attached to be sufficiently 
retarded in the separation step (e.g. the electrophoresis 
step) to allow the DNA molecule to be separated from a 
duplex without a high molecular weight compound attached. 
For example, the molecular weight of the high molecular 
weight molecule may be from 10 to 200 kDa, preferably 20 to 
100 kDa. 

In a preferred embodiment of the invention, DNA fragments 
from a particular segment of a genome can be analysed and 
identified with the use of a labelled reference strand 
without the isolation of the duplex from the analysis 
phase. This approach can be automated with the use of 
currently available technology for simultaneous resolution 
and identification of alleles. 

In this diagnostic use of the method, separation of the 
sample DNA fragments into a single strand or the use of 
ligand bearing primers is not necessary. The pre-labelled 
reference strand is mixed with the sample and, following 
the hybridisation step, the mixture is analysed by a method 
which will separate the DNA duplexes (eg electrophoresis or 
high pressure liquid chromatography) . This embodiment is 
referred to as "Double Strand Conformation Polymorphism" 
(DSCP) analysis. 

The selective identification of the marker bearing 
reference-sample duplexes excludes those hybrids which have 
not complexed with the reference strand. The position of 
the identified bands is diagnostic. These can be 
accurately assigned by the inclusion of reference mobility 
markers, which could be (a) internal markers with faster 
and slower mobilities than the duplex under investigation 
or (b) multiple markers that, for electrophoretic runs, 
would be resolved simultansously in a separate track and 
would form a reference ladder of graded mobilities. 
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In order to enhance the resolution of the method and to 
resolve ambiguities in the identification of alleles, more 
than one (e.g. from 2 to 5) reference strands can be used. 
The reference strand may be the complementary strand, 
5 another allelic version or a synthetic version which will 
also be hybridised individually with the DNA segments and 
the resulting duplexes will be analysed simultaneaously but 
separately. 

10 In this embodiment, the DNA sample fragments can be 

obtained by (a) specific amplification, e.g. PCR, or (b) by 
enzymatic digestion of the genome where the number of genes 
are limited (e.g. viral genomes, microbial plasmids, vector 
cassettes or the segments from (a) ) . 

15 

The reference strand, which is partially complementary in 
sequence to the intended sample target, may be prepared by 
introducing specific nucleotides at selected positions 
which enhance the separation of the duplexes during the 

2 0 separation stage and allow the specific amplification of 
the target sequences. The reference strand may be 
synthetic, i.e. man made and not from any natural ly- 
occuring allele. An optimum reference can be designed 
which produces an optimised distribution of bands. A 

25 second reference can be used which improves the resolution 
of any bands which are not well resolved by the first 
reference . 

The reference may be synthesised: 

30 

a. By a combination of specific primers which generate a 
strand with three nucleotide differences at 3' end, which 
would ensure the specificity of the amplification following 
the separation of the sample strands. 



b. By preparing short fragments of target sequence either 
by specific amplification or where necessary synthetically. 
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After specific alterations to the sequences these fragments 
would be ligated to produce a single reference strand. 

c. The reference strand may have a fluorescent ligand 
5 attached at one end for identification or it could carry a 
ligand or compound (e.g. biotin) that would allow 
attachment of an enzyme molecule (e.g. via streptavidin) . 
The reference -sample duplex can then be identified either 
by fluorescence detection methods or by enzyme amplified 
10 methods using e.g. a colourometric or chemiluminescent 
technique . 

This approach is suitable for automated separation and 
detection. In order to automate the analysis, it is 

15 proposed to include mobility markers in the separation 
phase. These labelled marks would be either defined 
segments of genomic DNA prepared by amplification or 
synthetically prepared so as to act as reference points for 
automatic computation of the exact location of the sample 

20 under investigation. 

Use of the invention to analyse ras oncogene point 
mutations : - ras has been implicated in the oncogenesis of 
many tumours and appears to be activated by point 
25 mutations. These mutations can occur in all three ras 

genes (N-ras, Harvey-ras and Kirsten-ras) at codons 12/13 
and 61 with corresponding amino acid substitutions in ras 
proteins (p21) . These point mutations can be detected by 
application of the invention. 

30 

Two pairs of primers are needed, one for the 12/13 codons 
and one for codon 61. The primers described by Lyons (29> 
can be used, with modification by covalent attachment of a 
ligand to one primer of each pair for each of the test 
3 5 fragment and the reference fragment. For example, a ligand 
may be attached to primer Pla (a 12/13 codon primer) and 
primer Plb (a 61 codon primer) for the reference fragment, 
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and to primer P2a (a 12/13 codon primer) and primer P2b (a 
61 codon primer) for the test fragment. In this way, 
complementary ligand- labelled single strands for the 
reference and test fragments are obtained. The 
5 complementary strands are hybridised and subjected to 
electrophoresis. Detection of a homoduplex between the 
test fragment and a mutant reference fragment will indicate 
the presence of the mutation in the test fragment. 

10 Use of the invention to identi fy T cell receptor (TCR) 
rearrangements in T cell tumours an d in adoptive 
immunotherapy : - some T cell tumours can be monoclonal in 
origin and a proportion of the T cells from a patient may 
carry a particular rearrangement of the T cell variable 

15 domain genes alpha/beta or gamma/delta depending on the T 
cell type. The efficacy of a particular treatment or the 
course of the disease can be evaluated by the 
identification of the malignant clone TCR rearrangement. 
The method of the invention with the use of suitable number 

20 of controls can be made semi-quantative , which would allow 
the evaluation of the progress of the treatment or the 
disease . 

In adoptive immunotherapy, a specific rearrangement of the 
2 5 variable domain genes of the TCR can be used as a marker 
for the selected cytotoxic T cell that has been generated 
in-vitro. Post infusion fate of these cells can be 
monitored by a semi-quantative detection of the particular 
rearrangement . 

30 

In both methods the T cell variable domain gene primers 
(30 - 31) can be modified by covalent attachment of a ligand at 
the 5' end of the primer pair, and the reference strand 
will be selected to be complementary to the test DNA 
3 5 fragments. 



The following Examples illustrate the invention. 
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Brief Description of the Drawings 

Fiqure 1 shows a schematic overview of an embodiment of the 
method of the invention. 

5 

Fiqure 2 shows the band pattern formed between the sense 
strand of LOS41265 <A*0101) with the anti-sense strand (s) 
of STEINLIN (A'0101) , WIN ( Al ) , DAUDI (A*0102, A*6601) , BM21 
(Al) and E4181324 (Al) under the following conditions: 8% 
10 polyacrylamide, 2 M urea, 10% formamide, 200 Volts during 6 
hrs at room temperature. 

Fiqure 3 shows duplexes formed between the sense strand of 
JY (A*0201) with the anti-sense strand (s) of M7 (A*0202, 
15 A*0301) , DK1 (A*0203, A*3301) , RML (A*0204), WT49 (A*0205), 

CIA (A*0206, A24), KNE (A*0207, A*0201) , KLO (A*0208, A*0101) 
and OZB (A*0209, A*0301) under the following conditions: 8% 
polyacrylamide, 2 M urea, 10% formamide, 20 0 Volts, 6 hrs, 
at room temperature. 

20 

Fiqure 4 shows the position of a heteroduplex band compared 
to a homoduplex control. The heteroduplex band corresponds 
to the hybridisation between the sense strand of JY 
(A*0201) and the anti-sense strand of RML (A*0204) which 
25 differ from each other ny only one nucleotide. Denaturing 
conditions were: 8% polyacrylamide, 3 M urea, 15% 
formamide, 200 Volts, 6 hrs. at 50°C. 

Fiqure 5 shows a Complementary Strands Analysis PAG/agarose 
30 gel. Following HLA-A locus specific amplification of exons 
2 Sc 3, the antisense DNA strands were isolated and 
hybridised with complementary locus reference strand. 
Samples were applied to PAG/agarose cassette and 
electrophoresis was performed at 230 volts for 6 hrs at 
3 5 room temperature; the gel was stained with SYBR Green I. 
Single bands were obtained from homozygous cell lines and 
double bands from heterozygous lines. Duplexes of 
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identical alleles from different sources have the same 
mobility in the gel; the samples in lanes 5 and 8 are from 
alleles A0101 and A0201, and the samples in lanes 1 and 9 
are from a homozygous line with A0101 which has the same 
5 mobility as the fast lines in lanes 5 and 8. Lane 1: 

STEINLIN (A*0101), Lane 2: KIME (A*0211 - A*3201) , Lane 3: 
DAUDI (A*0102 - A*6601) , Lane 4: EA (A*0301) , Lane 5: LCL721 
(A*0101 - A*0201) , Lane 6: M7 (A*0202 - A0301) , Lane 7: CJO- 
A (A*1101) , Lane 8: T5-1 (A0101 - A*0201) , Lane 9: L0541265 
10 (A*0101) , Lane 10: AM (A*0205 - A3201) . 

Figures 6 and 7 show electrophoretic gels of the duplexes 
formed in Example 2 . 

15 EXAMPLE 1 

1. Methods 

1 . l Summary 

20 

Amplification and isolation of the biotinylated anti-sense 
strand (s) were performed. They were then hybridised with 
allele specific reference sense single strands. 

25 In order to test the resolving power of this method, 7 
allele specific reference sense single strands were 
prepared. These were hybridised with several isolated 
anti-sense strand HLA-A alleles which were selected to 
include alleles with one or several nucleotide differences 

30 compared to the reference strand. 

Following this step, PAGE analysis was performed under 
several different denaturing conditions. Urea at a range of 
2-6 M and formamide between 10%-30% concentrations were 
3 5 used and the gels were run at either room temperature or 
50-58°C (200 Volts for 6 hours) . 
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10 



20 



25 



The identification of a homoduplex in the denaturing PAGE 
gels indicated identity between the allele specific single 
reference strand and at least one of the alleles from the 
unknown DNA s amp 1 e . 

As a positive control, double strand DNA PCR product from 
the allele reference was always used. 

3^2. Locus specific amplification of HLA class I genes 



For typing purposes amplification of exons 2 and 3 is 
desirable, and the primers were therefore selected to 
amplify the stretch of the genome between intron 1 and 
15 intron 3 . The localisation and nucleotide sequences of the 
HLA locus -specific primers used are given in the reagents 
section . 



PCR reactions were performed in a total volume of 100/il 
using 1/xg of genomic DNA and 25 pmoles of each locus - 
specific primer. The 3 ' -primer was biotinylated at 5-' -end. 
This arrangement ensures the incorporation of the 
biotinylated primer onto the amplified antisense DNA 
strand. PCR conditions are given in the following table. 

Thermocvclina conditions 



A and C loci 95 °C 4 min. l cycle 

30 

95°C 30 sec. 

70°C 50 sec. 33 cycles 

72°C 30 sec. 



35 



72°C 



8 min. 



1 cycle 



B locus 



95°C 
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4 min . 



1 cycle 



95°C 



30 sec. 



65°C 



50 sec 



33 cycles 



72°C 



30 sec 



72°C 



8 min. 



1 cycle 



1.3 Separation of the amplified DN A strands 
1.3a Removal of non-biot inylat ed strand: 

Magnetic beads with covalently coupled streptavidin on the 
surface were added to the PCR product and incubated for 3 0 
minutes at 43 °C. In this way the amplified PCR product was 
immobilised by the interaction of biotin and streptavidin. 
After incubation, the tubes were placed against a magnet 
and the beads were washed with washing buffer to remove the 
remaining PCR reaction components. 

The non-biotinylated DNA strand was then dissociated from 
the beads by incubation with 0.1 M NaOH at room temperature 
(r.t.) for 5 minutes <X3) . Following this the beads were 
washed to remove excess NaOH and resuspended in 50 jzl of 
hybridisation buffer. 
5 

1 .3b Removal of biotinylated DNA strand: 

The bead suspension was heated at 95 °C for 5 minutes; this 
ensures denaturation of the streptavidin molecule to 
0 release the biotinylated amplified anti-sense single strand 
which was then removed and placed in a clean tube. 
At this stage, the isolates contained single biotinylated 
DNA strands from each allele. 

5 1.4 Preparation of allele specific reference single- 
stranded DNA 



10 
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DNA was extracted from 10th IHW cell lines. The following 
homozygous cell lines were selected as allele specific 
reference DNA: JY (A*0201) , RML (A*0204), WT49 (A*0205), 
T7527 (A*0206) , KRC-005 (A*0212), AMALA (A*0217) and 
L0541265 (A # 0101) . 

The PCR conditions for amplification were as above, with 
the exception that in each case the locus-specific 5'- 
primer was biotinylated (5' -end). The PCR products were 
analysed by PAGE to assess the fidelity of the 
amplification and in all cases a single band was obtained. 

1 . 5 Hybridisation 

15 The biotinylated anti-sense strand (s) from above were mixed 
with the sense strands, and the mixture was heated at 95°C 
for 3 min., incubated at 70°C for 5 min., and then at 65°C 
for 45 min. Under these conditions, the sense and anti- 
sense strands were hybridised. The heteroduplexes formed 

2 0 could subsequently be separated from each other by 
electrophoresis in polyacrylamide gel. 

2 . Reagents : 

25 A) Nucleotide sequences of primers used for locus-specific 
amplification: 

5' A locus primer: GAA ACG/C GCC TCT GT/CG GGG AGA AGC AA 
(Intron 1: 21-46) 

30 

3' A locus primer: TGT TGG TCC CAA TTG TCT CCC CTC 
(Intron 3 : 66-89) 

5' B locus primer: GGG AGG AGC GAG GGG ACC G/CCA G 
35 (Intron 1: 36-57) 



B locus primer: 



GGA GGC CAT CCC CGG CGA CCT AT 



30 - 



(Intron 3 : 37-59) 



5' C locus primer: AGC GAG GG/TG CCC GCC CGG CGA 
(Intron 1: 42- 61) 

3' C locus primer: GGA GAT GGG GAA GGC TCC CCA CT 
(Intron 3: 12-35) 



B) Buffers: 

Washing buffer: 



Hybridisation buffer: 



PGR buffer: 



TE buffer 



10 mM Tris-HCl pH 7 . 5 

1.0 mM EDTA 

2.0 M NaCl 

20 mM Tris-HCl pH 8.4 

50 mM KC1 

20 mM Tris-HCl pH 8.4 

50 mM KC1 

0.2 mM MgCl2 

10 mM Tris-HCl pH 7 . 5 

1 mM EDTA 



C) Various 

Dynabeads M-280 Streptavidin (10 mg/ml) 

0 

Magnetic particle concentrator -Dynal MPC 

A Thermal cycler (PTC-200 Peltier Thermal Cycler MJ Research) 

5 Ultrapure dNTP set, 2' -Deoxynucleoside 5' -Triphosphate 
(Pharmacia Biotech) 
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Taq DNA Polymerase (Gibco BRL) 
50 mM MgCl2 
5 0.1 M NaOH 

SeaPlaque Agarose (Flowgen Instruments Ltd) 

Protogel, 3 0% Acrylamide and 0.8% Bisacrylamide (National 
10 Diagnostics) 

Results: Identification of alleles bv CSA 

Seven allele specific reference sense strands were isolated 
15 from homozygous cell lines and hybridised to several anti- 
sense strands from other cell lines whose HLA specificity was 
defined by sequencing and in some cases only by serology. 

Alleles could be identified by homoduplex formation with a 
20 reference strand, which would migrate at the same rate as the 
double stranded DNA reference band (control band) in 
denaturing polyacrylamide gel. Non-identity would lead to 
heteroduplex formation and cathodic band(s) compared with the 
control band. 

25 

Under the denaturing conditions of 2 M Urea and 10% formamide 
at room temperature, it was observed that when the reference 
strand matched completely the anti-sense strand a single 
homoduplex band was visible in the gel. In cases when the 

3 0 reference strand differed by 3 or more bases from the anti- 
sense strand, a cathodic band corresponding to a heteroduplex 
was seen. This pattern was observed reproducibly for all 
the allele specific reference strands and is therefore 
independent of the position of the mismatch (es) on the 

3 5 strands and the specific base sequence of the allelic 
reference (Figures 2 and 3) . 



Under these conditions, the heteroduplex band containing 1 or 
2 base mismatches was indistinguishable from the homoduplex 
band. By altering the denaturing conditions to 3 M urea and 
15% formamide at 50°C, it was possible to distinguish between 
heteroduplexes with one or two mismatches from the homoduplex 
control band (Figure 4) . 

In further experiments, it was shown that heteroduplexes of 
the same alleles from different sources migrate to the same 
position on the gel. Figure 5 shows the results from such an 
experiment. Lanes 1, 5 and 8 contain duplexes comprising 
A0101 from different sources and the duplexes all migrated to 
the same position. Lanes 5 and 8 also contain duplexes 
comprising A0201 from different sources and these migrated at 
the same speed, but more slowly than the A0101 duplexes. 

EXAMPLE 2 

This Example shows how DNA fragments from a particular 
segment of a genome can be analysed and identified using a 
labelled reference strand without isolation of the duplex 
from the analysis phase. The Example uses specific reference 
strands for HLA-A, B and Cw alleles. 

The reference strands were prepared using sense primers for 
the three loci, which had fluorescent dye (Pharmacia Ltd) at 
the 5' -end. Separate PGR reactions were performed for each 
locus to produce the reference reagents. The primer 
sequences were as described above in Example 1 and the 
amplified reference strands included exons 2 and 3 of each 
locus. Lymphoblastoid, homozygous cell lines were used for 
each locus amplication : - 

HLA-A STEINLIN (10th IHW) 

HLA-B SP0010 (10th IHW) 

HLA-Cw STEINLIN (10th IHW) 



A*0101 
B*4402 
Cw*0701 
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DNA samples were prepared in a similar manner from other cell 
lines using non-biot inylated locus specific primers. 10 fil 
of sample solution and 10 fil of the relevant reference were 
mixed and hybridised (as- in item 2.5 of Example 1) and 
5 applied to 6% non -denaturing PAG (Long Ranger) . The 
electrophoretic analysis was performed in an ALFexpress DNA 
sequencer™ (Pharmacia Ltd) and the mobilities of the labelled 
duplexes were determined with an integral laser activated 
system with photodiode detectors. The results are shown in 
10 Figures 6 and 7 and in the following tables. Figure 6 
corresponds to tables la and lb and Figure 7 corresponds to 
tables to 2a and 2b. 

Table la 

15 





Name 


peaKw 


Run Time 


Peak Area 


1 


A16 


1 


55 :00 


3985 


2 


57:24 


77. 923 


3 


63 :38 


204 . 66 


2 


A17 


1 


54 :46 


4270 . 2 


3 


A18 


1 


54 :43 


3681.8 


2 


57 :36 


223 .44 


3 


63 :15 


245 .22 


4 


A19 


1 


54 :42 


4304 . 3 


2 


58 :06 


29 . 012 


3 


63 :25 


50 .298 


5 


A24 


1 


54 :44 


5991 . 7 


2 


59:49 


703 . 08 


6 


A3 0 


1 


54 :43 


4456 . 7 


2 


58 :13 


122 . 57 
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A3 5 



54 :45 



63 :38 



4181 .7 



855 . 39 



A3 6 



54 :49 



59:50 



4750 .6 



1274 .2 



A3 8 



54 :52 



59:47 



4517.5 



413 .31 



10 



11 



12 



A44 



A4 6 



A4 9 



54 :52 



57:50 



54:53 



63 :28 



54:55 



58 :41 



5738 



740 . 28 



4117.5 



2119 . 5 



5290 . 9 



606 . 82 



13 



14 



A53 



A54 



54 :55 



60 :12 



54 :56 



4543 . 8 



370 . 19 



4571. 5 



10 



15 



16 



A55 



B17 



54 : 56 



60 : 10 



63 : 01 



53 :25 



4330 . 5 



952 .01 



596 .82 



3498 . 2 



17 



B18 



53 :26 



3551.3 



18 



B19 



53 :26 



59 :53 



3591.5 



370 . 25 



19 



B20 



53 :28 



3528 . 1 



15 



20 



21 



B24 



B27 



53 :30 



53 :31 



3566 . 5 



3620 . 1 



1 


• 
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• 




Z Z 


B3 0 


1 


53 : 29 


3714 . 1 


*~> ~\ 




1 


53:27 


2024 . 8 






2 


62:06 


1871.2 


Z 4 


B3 2 


1 


53 : 27 


3918 . 5 


Z 3 


B3 / 


1 


53:28 


2580 . 1 






2 


5 8:56 


711 . 11 






3 


63 : 05 


689 . 07 


Z b 


H3 o 


1 


53:27 


2498 . 9 






2 


61 :48 


2160 . 5 


«i / 


B44 


1 


53:27 


2151 






2 


58 : 56 


2229 . 7 


2 8 


B4 9 


1 


53 :25 


4502 . 2 


z y 


B54 


1 


53 : 21 


2258 . 3 






2 


56 : 04 


907.41 






3 


59 : 16 


1205 . 8 


J U 


C16 


1 


52:20 


3921 . 2 






2 


63 : 04 


3312 . 1 


J 1 


C3 0 


1 


52 : 16 


3641 .4 






2 


62:06 


211.2 






3 


63:06 ' * 


2506 . 9 


3 2 


C44 


1 


52 : 13 


3087 . 2 






2 


61:23 


1556 . 5 






3 


62 : 10 


1725 . 1 


33 


C62 


1 


52 : 22 


6229 .4 








52:08 


4 52 0 . 8 
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5 







2 


61:30 


1629 .2 






3 


62 :21 


1407.1 


35 


C67 


1 


52 :06 


4188 . 2 






2 


62:39 


2970 . 1 






3 


63 :11 


2359.2 


36 


C68 


1 


52:09 


3469 . 6 






2 


62:07 


5392.5 


37 


C70 


1 


52 :12 


3474 . 3 






2 


63 :01 


1624 . 5 






3 


63 :20 


2068 . 1 


38 


C71 


1 


52 :14 


3246 .4 






2 


61:59 


834 .69 






3 


62:40 


755 .31 


39 


C75 


1 


52 :17 


3885 . 1 






2 


63 :45 


4930 .2 


40 


C80 


1 


52 :24 


3486.4 






2 


61:37 


5290 . 5 
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Table lb 





Lane 






Lane 






5 


No 


ID 


HLA type 


No 


ID 


HLA type 




1 . 


A16 


- A3, A24 


21 . 


B27 


- B38 




2 . 


A17 - 


- A2, All 


22 . 


B30 


- B*4002 


10 


3 . 


A18 - 


- A24, A31 


23 . 


B31 


- B*5701 




4 . 


A19 ■ 


- A24, A*2602 


24 . 


B32 


- B41 




5 . 


A24 ■ 


- A*3001 


25. 


B37 


- 6*1501, B*1520 




6 . 


A3 0 - 


- A*2902 


26 . 


B38 


- B18 




7 . 


A3 5 - 


- A*2501 


27. 


B44 


- B*5801, 


15 


8 . 


A3 6 - 


- A*3001 


28 . 


B49 


- B65 




9 . 


A3 8 - 


- A*3002 


29. 


B54 


- B35, B53 




10 . 


A44 - 


- A*0102, A*6601 


30. 


C16 


- Cw5 




11 . 


A4 6 - 


- A24 


31. 


C3 0 


- Cw*02022 




12 . 


A49 - 


- A33 


32 . 


C44 


- Cw*1701 


20 


13 . 


A53 - 


- A*0206, A24 


33 . 


C62 


- Cw*0702 




14 . 


A54 - 


- A*0202, A*0301 


34 . 


C63 


- Cw*03 04, Cwl4 




15. 


A55 - 


- A*0211, A*3201 


35. 


C67 


- Cw4, Cw*12 03 




16 . 


B17 - 


- 6*1501, B*3501 


36. 


C68 


- Cw*0602, Cw8 




17 . 


B18 - 


- B35, B52 


37. 


C70 


- Cw4 , Cw5 


25 


18 . 


B19 - 


- B51, B61 


38. 


C71 


- Cw*0704, Cwl5 




19 . 


B20 - 


- B14 


39. 


C75 


- Cw*1701 




20 . 


B24 - 


- B*1302 


40 . 


C80 


- Cw*1601 



30 

Table 2a 



Lane# 


Name 


Peak# 


Run Time 


Peak Area 


1 


A56 


1 


75:08 


9699 


2 


84:12 


498 .55 


3 


87:59 


901.42 


2 


A57 


1 


74 :47 


10417 


2 


83 :15 


1427 . 6 


3 


A62 


1 


74 : 51 


9838 . 4 


2 


83 : 09 


1621 . 6 


4 


A63 


1 


74 :53 


10018 


2 


80:43 


1759 . 1 


3 


82 :52 


923 . 9 


5 


A64 


1 


74 :53 


9311 . 3 


2 


82 :51 


3199 . 2 
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5 



10 





riu \j 


1 


74 : 53 


9733 . 5 






2 


80:57 


383 . 1 






3 


83 : 01 


172 .47 


/ 


A67 


1 


74 : 55 


10871 






z 


76:55 


1311 . 1 








83:27 


899.79 


Q 
O 


x\0 O 


1 
_L 


74 : 55 


9828 . 2 






Z 


81:08 


237 . 78 










681.24 


Q 


7V70 

i~i / VJ 


1 


74 : 59 


8811 






z 


ft 1 * 1 0 

O JL ■ J- VJ 


512 . 7 






*3 


82:56 


478 . 5 


10 


A71 


T 
J_ 


/ *± . J o 


9905.9 






Z 


/ -7 - VJ / 


277 .38 






-3 


83:30 


111 . 88 


11 


7\ n O 

A / Z 


-1 
X 


75 : 01 


10283 






z 


ft 1 • 1 0 


492.76 




/\ / ft 


_L 


75:01 


8528 . 5 






Z 




801.28 






*3 
j5 


ft 3 - 3 Q 


581.53 


13 


/\ / z> 




75:01 


9137 . 8 






z 


ft i * i n 


188.53 








83:17 


157 . 44 


14 


All 


1 


/ D IUj 


ft S78 9 






z 


77.19 
/ / . X z 


6 03 . 96 








Q 0 • 3 Q 

_7 VJ . J ^ 


419.15 


15 


A84 


1 


/ b : U U 


O / -? Z . -7 




2 


83 : 18 


97 .617 


16 


B56 


1 


72 :40 


2713 . 7 




2 


83 : 03 


. 2505.7 






3 


99 :10 


3217.4 


17 


B62 


1 


72 :37 


2553 




2 


88 : 26 


5435 . 7 
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5 



10 



18 


DO J 


L 


72:44 


2191.7 






9 


"7/1 .HQ 


4 84 . 13 








8 3:09 


2083 . 4 


19 




T 
X 


/ 2 : 4 6 


2819.8 






2 


81:25 


504 . 57 






J> 


104 : 4 9 


967.67 


2 0 


DO / 


1 
X 


72 : 4 5 


1914 . 3 






2 


82 : 18 


3043 . 9 


21 


R79 


_L 


72:42 


1835 






2 


88:01 


5588 . 4 


22 


D / *± 




72 : 4 5 


1906 . 9 






2 


80:48 


1435 . 1 






3 


88:08 


2536 


9 7 


T2 *7 C 


1 


72:41 


3216 . 6 






2 


84:36 


1407 . 2 


9 A 


rS / / 


1 


72 : 40 


3032 . 2 






2 


88:30 


1229. 






3 


90 : 00 


1313 . 9 


9 R 


DO 


1 


72:35 


1562 . 9 






2 


82:47 


3760 . 9 


2 D 


B84 


1 


72 : 33 


1539 






2 


80 : 12 


2321.8 






3 


85 : 55 


2364 .4 


9 *7 


"DOC 

Dob 


1 


72:27 


2068 .4 






2 


81 : 06 


1881 .4 






3 


82:38 


2372 . 2 




B86 


1 


72 : 18 


2087 . 1 






2 


o2 : 22 


4498 


29 


C16 


1 


70:42 


2188 






2 


89:03 


9578 . 6 


30 


C3 0 


1 


70 :42 


3485 .2 






2 


89 :43 


13029 


31 


C4 4 


1 


70 : 38 


2491 . 6 
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2 


o o : u ? 


5623.3 






*3 


87:53 


6152 . 7 


32 


C62 


1 




1552 . 1 






2 


/ _L . VJ J 


1762 . 3 


33 


C63 


J. 


70 - 24 


1977 






2 


o b : ±«5 








3 


op .no 


4273 .4 


34 


C67 


1 


7 0 : 2Z 


Z O J ft . -? 




2 


oo : lb 


CD f|A ft 






3 




6971 


35 


C6 8 


1 


/ 0 : o Z 


O *7 ft £ O 






2 


o / : 4 y 




36 


C70 


1 


/ U . ZD 


2557 7 

4& -J -J 1 • ' 




2 


o o : 4 o 


O .3 ft .O 






3 


O _7 . ft O 


7716 . 3 


37 


C71 


1 


70:2b 


O *7 cr Q O 




2 


87 : 14 


2613 .8 






3 


88 :21 


2685 . 5 


38 


C75 


1 


70 :25 


2259 .2 




2 


90:39 


10368 


39 


C80 


1 


70:23 


1811 




2 


86:22 


11660 
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Table 2b 



15 



25 



Lane 
No 


ID 


HLA type 


Lane 
No 


ID 


HLA type 


1 . 


A56 


- A*0205/A*3201 


21. 


B72 • 


- B62 


2 . 


A57 


- A*0207/A*0101 


22. 


B74 ■ 


- B58/B70 


3 . 


A62 


- A*0201 


23 . 


B75 - 


- B42 


4 . 


A63 


- A*0203/A*3301 


24 . 


B77 - 


- B*1502/B5502 


5 . 


A64 


- A*0204 


25 . 


B82 - 


- B8/B27 


6 . 


A66 - 


- A*0208/A*0101 


26 . 


B84 ■ 


- B17/B18 


7 . 


A67 • 


- A*0209/A*0301 


27 . 


B85 - 


- B51/B62 


8 . 


A68 ■ 


- A*0210/A*3001 


28 . 


B86 - 


- B*480l/B*6701 


9 . 


A70 - 


- A*0213/A*1101 


29 . 


C16 - 


- Cw5 


10 . 


A71 - 


- A*0216/A*0301 


30 . 


C3 0 - 


- Cw*02022 


11 . 


A72 - 


- A*0217 


31. 


C44 - 


- Cw*060l/Cw*302 


12 . 


A74 - 


- A2/A*8001 


32 . 


C62 - 


- Cw*0702 


13 . 


A75 - 


- A*300l/A*6802 


33 . 


C63 - 


- Cw*0304/Cwl4 


14 . 


A77 - 


- All/A*2403 


34 . 


C67 - 


- Cw4/Cw*2 03 


15 . 


A84 - 


- A30/A*6602 


35. 


C68 - 


- Cw*0602/Cw8 


16 . 


B56 - 


- B*490l/B60 


36. 


C70 - 


- Cw4/Cw5 


17 . 


B62 - 


- B*0702 


37. 


C71 - 


- Cw*0704/Cwl5 


18 . 


B63 - 


- B*4403/B*4001 


38. 


C75 - 


- Cw*1701 


19 . 


B66 - 


- B*080l/B*5001 


39. 


C80 - 


- Cw*1601 


20 . 


B67 - 


- B35/B39 
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CLAIMS 



1. A method for separating a DNA molecule from a mixture 
DNA molecules having the same number of nucleotides but 

fferent base sequences, which method comprises: 

(i) amplifying the DNA molecules in the mixture; 

(ii) hybridising single strands of the amplified DNA 
molecules with a complementary strand of a 
reference DNA molecule so as to form duplexes; 
and 

(iii) separating the duplexes. 

2 . A method according to claim 1 which comprises 

(i) amplifying the DNA molecules in the mixture 
employing a pair of primers in which one of the 
primers carries a ligand, so as to produce an 
amplified mixture of double -stranded DNA 
molecules in which one of the strands carries a 
ligand; 

(ii) contacting the amplified mixture of double- 
stranded DNA molecules with a receptor on a 
solid support under conditions such that the 
ligand binds to the receptor; 

(iii) separating the mixture of double -stranded DNA 
molecules into single-strands and removing the 
strands that are not bound to the support by the 
ligand; 



(iv) recovering the remaining strands from the 

support ; 



- 47 - 

(v) mixing the recovered strands with a 

complementary strand of a reference DNA molecule 
so as to form duplexes; and 

5 (vi) separating the duplexes. 

3 . A method according to claim 2 wherein the 
complementary strand of the reference DNA molecule is 
provided by 

10 

(i) amplifying the reference DNA molecule employing 
a pair of primers in which one of the primers 
carries a ligand, so as to produce amplified 
double -stranded reference DNA molecule in which 

15 one of the strands carries a ligand; 

(ii) contacting the double -stranded reference DNA 
molecule with a receptor on a solid support 
under conditions such that the ligand binds to 

2 0 the receptor; 

(iii) separating the double-stranded reference DNA 
molecule into single- strands and removing the 
strand that is not bound to the support by the 

25 ligand; and 

(iv) recovering the remaining strand from the 
support . 

30 4. A method according to claim 1 which comprises 

(i) amplifying the DNA molecules in the mixture 

employing a pair of primers in which one of the 
primers carries a high molecular weight 

3 5 molecule, so as to produce an amplified mixture 

of double -stranded DNA molecules in which one of 
the strands carries a high molecular weight 
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molecule ; 

(ii) separating the mixture of double -stranded DNA 
molecules into single strands ,- 

(iii) mixing the single strands with a complementary 
strand of a reference DNA molecule so as to form 
dup lexes; and 

(iv) separating the duplexes. 

5. A method according to claim 4 wherein the 

complementary strand of the reference DNA molecule is 
provided by 

(i) amplifying the reference DNA molecule employing 
a pair of primers in which one of the primers 
carries a high molecular weight molecule, so as 
to produce an amplified double -stranded 
reference DNA molecule in which one of the 
strands carries a high molecular weight 
molecule; and 

(ii) separating the double -stranded reference DNA 
molecule into single strands. 

6 . A method according to claim 1 which comprises 

(i) amplifying a single strand of each of the DNA 
molecules in the mixture ; 

(ii) mixing the amplified single strands with a 
complementary strand of a reference DNA molecule 
so as to form duplexes; and 



(iii) 



separating the duplexes. 
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7. A method according to claim 6 wherein the 
complementary strand of the reference DNA molecule is 
provided by amplifying a single strand of the reference DNA 
molecule . 

5 

8 . A method according to any one of the preceding claims 
wherein the reference DNA molecule has a known sequence. 

9 . A method according to any one of the preceding claims 
10 wherein the duplexes are separated by gel electrophoresis. 

10. A method according to claim 9 wherein the 
electrophoresis is performed under denaturing conditions. 

15 11. A method for identifying a DNA molecule in a mixture 

of DNA molecules having the same number of nucleotides but 
different base sequences, which method comprises separating 
the DNA molecules by a method as defined in claim 9 or 10, 
and comparing the positions of the separated duplexes on the 

20 gel with the position of a control DNA molecule. 

12 . A method for identifying a DNA molecule in a mixture 
of DNA molecules having the same number of nucleotides but 
different base sequences, which method comprises separating 

25 the DNA molecules by a method as defined in any one of claims 
1 to 10 and sequencing each of the separated DNA molecules, 
carrying out sequence specific primer (SSP) amplification 
analysis or carrying out sequence specific oligonucleotide 
(SSO) analysis. 

30 

13 . A method according to any one of the preceding claims 
wherein the mixture of DNA molecules is a mixture of alleles 
of a polyallelic gene. 

35 14 . A method according to claim 13 wherein a reference 

allele is used which has the same serotype as at least one of 
the alleles in the mixture of alleles. 
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15 . A method according to claim 13 or 14 wherein the 
mixture of alleles is from a prospective recipient or a 
prospective donor in a tissue or organ transplant operation. 

5 16. A method for determining whether a prospective 

recipient in a tissue or organ transplant operation has 
alleles of a gene that are compatible with the alleles of a 
prospective donor in the operation, which method comprises 

10 (i) amplifying the alleles of the prospective 

recipient employing a pair of primers in which 
one of the primers carries a ligand, so as to 
produce amplified double -stranded alleles of the 
prospective recipient in which one of the 

15 strands carries a ligand; 

(ii) contacting the amplified double -stranded alleles 
with a receptor on a solid support under 
conditions such that the ligand binds to the 

20 receptor; 

(iii) separating the double- stranded alleles into 
single -strands and removing the strands that are 
not bound to the support by the ligand; 



25 



30 



(iv) recovering the remaining strands from the 
support ; 

(v) mixing the recovered strands with complementary 
strands of the alleles of the prospective donor 
so as to form test duplexes; 



(vi) separating the test duplexes by gel 

electrophoresis; and carrying out one or more of 
35 the following steps: 



(vii) comparing the positions to which the test 



duplexe migrate on the gel with the position of 
a control DNA molecule; 



(viii) sequencing the test duplexes; 

(ix) sequence specific primer (SSP) amplification 
ana lysis; and 

(x) sequence specific oligonucleotide (SSO) 
analysis . 

17. A method according to claim 16 wherein the 

complementary strands of the alleles of the 
prospective donor are provided by 

(i) amplifying the alleles of the prospective donor 
employing a pair of primers in which one of the 
primers carries a ligand, so as to produce 
amplified double -stranded alleles of the 
prospective donor in which one of the strands 
carries a ligand; 

(ii) contacting the amplified double -stranded alleles 
with a receptor on a solid support under 
conditions such that the ligand binds to the 
receptor; 

(iii) separating the double-stranded alleles into 
single -strands and removing the strands that are 
not bound to the support by the ligand; and 

(iv) recovering the remaining strands from the 
support . 



18 . A method according to claim 16 or 17 wherein the 
rospective donor is selected to have alleles of the same 
erotype as the prospective recipient. 
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19. A method according to any one of claims 16 to 18 
wherein the control DNA is a homoduplex between two strands 
of the same allele and migration of the test duplexes to the 
same position on the gel as the homoduplex indicates that the 

5 prospective recipient and the prospective donor have the same 
alleles . 

20. A method according to any one of claims 13 to 19 
wherein the alleles are of a human leucocyte antigen (HLA) 

10 class I gene or an HLA class II gene. 

21. A method according to claim 20 wherein the alleles 
are of the HLA -A gene, the HLA-B gene or the HLA-C gene. 

15 22. A method according to any one of claims 2, 3 and 9 to 

21 wherein the ligand is biotin and the receptor is 
streptavidin . 

23. A method according to any one of claims 2, 3 and 9 to 
20 22 wherein the solid support is magnetic beads, and the 

strands which do not carry a ligand are removed by attracting 
the beads to a magnet and washing the beads under conditions 
such that the double- stranded DNA molecules dissociate into 
single strands. 

25 

24. A method according to any one of the preceding claims 
wherein amplification of the DNA molecules is performed by 
polymerase chain reaction (PGR) . 

30 25. A method according to any one of claims 15 to 24 

wherein the prospective recipient and the prospective donor 
are a prospective bone marrow recipient and a prospective 
bone marrow donor, or a prospective kidney recipient and a 
prospective kidney donor. 

35 

26 . A method for identifying a DNA molecule, which method 
comprises : 



(i) contacting the DNA molecule with a labelled 
reference DNA strand under conditions such that 
the reference strand hybridizes to a 
complementary strand of the DNA molecule so as 
to form a test duplex; 

(ii) running the test duplex and one or more control 
duplex (es) in a gel by electrophresis ; and 

(iii) comparing the position of the test duplex on the 
gel with the position (s) of the control 
duplex (es) . 

27. A method according to claim 26 wherein the control 
duplexes are either (a) duplexes which have faster and/or 
slower mobility than the test duplex and which are run in the 
same lane on the gel as the test duplex or (b) duplexes which 
have graded mobilities and which are run in a different lane 
on the gel to the test duplex. 

28. A method according to claim 26 or 27 wherein the 
reference strand is labelled with a fluorescent label or a 
label suitable for attachment of an enzyme. 

29. A method according to any one of claims 26 to 28 
wherein more than one labelled reference strand is used. 



ABSTRACT 



METHODS FOR SEPARATING AND IDENTIFYING DNA MOLECULES 

The invention provides a method for separating a DNA 
molecule from a mixture of DNA molecules having the same 
number of nucleotides but different base sequences, which 
method comprises : 

(i) amplifying the DNA molecules in the mixture; 

(ii) hybridising single strands of the amplified DNA 
molecules with a complementary strand of a 
reference DNA molecule so as to form duplexes; 
and 

(iii) separating the duplexes. 

The different DNA molecules in the original mixture give 
rise to duplexes having different numbers, positions or 
types of mismatches. This allows the duplexes to be 
separated by, for example, gel electrophoresis. The 
separated duplexes can then be analysed to identify the DNA 
molecules that were present in the original mixture. 
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